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Abstract 

Background: Clinical trial participants may be temporarily absent or withdraw from trials, leading to missing data. 
In intention-to-treat (ITT) analyses, several approaches are used for handling the missing information - complete 
case (CC) analysis, mixed-effects model (MM) analysis, last observation carried forward (LOCF) and multiple 
imputation (Ml). This report discusses the consequences of applying the CC, LOCF and Ml for the ITT analysis of 
published data (analysed using the MM method) from the Fracture Reduction Evaluation (FREE) trial. 

Methods: The FREE trial was a randomised, non-blinded study comparing balloon kyphoplasty with non-surgical 
care for the treatment of patients with acute painful vertebral fractures. Patients were randomised to treatment (1:1 
ratio), and stratified for gender, fracture aetiology, use of bisphosphonates and use of systemic steroids at the time 
of enrolment. Six outcome measures - Short-form 36 physical component summary (SF-36 PCS) scale, EuroQol 5- 
Dimension Questionnaire (EQ-5D), Roland-Morris Disability (RMD) score, back pain, number of days with restricted 
activity in last 2 weeks, and number of days in bed in last 2 weeks - were analysed using four methods for dealing 
with missing data: CC, LOCF, MM and Ml analyses. 

Results: There were no missing data in baseline covariates values, and only a few missing baseline values in 
outcome variables. The overall missing-response level increased during follow-up (1 month: 14.5%; 24 months: 
28%), corresponding to a mean of 19% missing data during the entire period. Overall patterns of missing response 
across time were similar for each treatment group. Almost half of all randomised patients were not available for a 
CC analysis, a maximum of 4% were not included in the LOCF analysis, and all randomised patients were included 
in the MM and Ml analyses. Improved estimates of treatment effect were observed with LOCF, MM and Ml 
compared with CC; only MM provided improved estimates across all six outcomes considered. 

Conclusions: The FREE trial results are robust as the alternative methods used for substituting missing data 
produced similar results. The MM method showed the highest statistical precision suggesting it is the most 
appropriate method to use for analysing the FREE trial data. 

Trial Registration: This trial is registered with ClinicalTrials.gov (number NCT0021 121 1). 
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Background 

The intention-to-treat (ITT) principle is one of the most 
important approaches used in performing statistical ana- 
lyses for randomised clinical trials [1]. It implies that for 
all patients randomised to a treatment, data should be 
analysed according to the treatment to which the patient 
was allocated, irrespective of whether they received this 
or some other treatment, or no treatment at all. Thus, 
patients randomised to a control arm who actually 
receive the treatment under investigation are regarded 
as 'controls' rather than 'treated cases'; in addition, 
patients who fail to comply with their assigned treat- 
ment are assessed as though the treatment was taken 
correctly [2]. The purpose of the ITT principle is to pre- 
serve the theoretical basis for the validity of the statisti- 
cal results, specifically by eliminating the possibility that 
patients with known or unknown prognostic factors are 
systematically selected to a treatment [1]. 

Nonetheless, despite being reported as early as the 
1960s [3], included in the International Conference on 
Harmonisation (ICH) Good Clinical Practice (GCP) 
guidelines [4], and being clearly described in methodolo- 
gical papers specifically targeted at an orthopaedic audi- 
ence [1], the ITT principle is not well known or 
uniformly used by orthopaedic investigators. Indeed, a 
recent systematic review (covering the years 2005-2008) 
showed that the ITT principle was adhered to in only 
96 out of 274 (35%) published orthopaedic randomised 
trials [5]. 

It is not uncommon for patients participating in clini- 
cal trials to be temporarily absent from follow-up visits 
or terminate their involvement. When this happens, 
some or all information on the treatment outcome for 
these patients will not be available for the statistical ana- 
lysis. Depending on the statistical method used, several 
approaches are available for handling the missing infor- 
mation in the ITT analysis. Either the available informa- 
tion can be analysed in its current form, or the missing 
data can be replaced with hypothetical observations 
(imputed data) to allow patients for whom some data 
are missing to be included in the analysis. The method 
used to adjust for missing data should be chosen appro- 
priately, depending on the reason for the missing data. 

By imputing data (i.e. filling in missing values with 
reasonable data), patients with some missing informa- 
tion can be included in an analysis. In fact, inclusion of 
patients with complete information only may not repre- 
sent a randomly selected subset of the overall rando- 
mized patient population. A complete-case (CC) analysis 
includes only those patients with both baseline and cor- 
responding follow-up outcome values and, as such, is at 
variance with the ITT principle (which requires that all 
cases, whether complete or incomplete, are included in 



the analysis) [2]. An alternative for including all patients 
is to perform a mixed-effects model (MM) analysis. By 
modelling both random and fixed effects, this technique 
can be used to analyse patients with incomplete follow- 
up. 

Missing data can be imputed via simple or multiple 
imputation methodology. One of the most frequently 
used simple imputation techniques is last observation 
carried forward (LOCF). In this method, the last 
observed values for a patient are used in place of the 
missing values; this method assumes that the last known 
state in the study represents the patient's true outcome 
[2]. This implicit assumption of representativity (which 
is present in all simple imputation techniques) may be 
considered a weakness, and should be questioned when 
the treatment effect changes during follow-up. Further- 
more, simple imputation techniques are deterministic 
and lead to an underestimation of the variability in the 
outcome. 

In contrast, multiple imputation (MI) is a stochastic 
technique which depends on model-based imputation of 
multiple values for each missing observation. The values 
are combined using the technique described by Rubin in 
1987 [6]. Thus, MI provides a superior alternative to 
simple imputation. It does not underestimate variability, 
and the models for imputation and efficacy analysis can 
be developed independently, thereby increasing the rea- 
lism of the assumptions underlying the imputation [2]. 

The advantages provided by the most recently devel- 
oped techniques, such as MM and MI (compared with 
CC and LOCF), are also associated with much greater 
complexity. As a consequence, widespread application of 
these techniques may be delayed because those unfami- 
liar with the methodology could have concerns about 
the validity of the results generated. Thus, further dis- 
cussion about the importance of the ITT principle, 
along with greater understanding of the problems asso- 
ciated with missing values in orthopaedic randomised 
trials, is required. 

The aim of this report is to describe and discuss the 
consequences of applying the three alternative methods 
(CC, LOCF and MI) for the ITT analysis of the Fracture 
Reduction Evaluation (FREE) trial - a randomized con- 
trolled trial of balloon kyphoplasty (BKP) versus non- 
surgical care for vertebral compression fracture, analysed 
using the MM method and recently published in The 
Lancet [7]. 

Materials and methods 

The Fracture Reduction Evaluation (FREE) trial 

Detailed methodology for the FREE trial has been pre- 
sented previously [7]. In brief, the FREE trial was a ran- 
domised, non-blinded trial comparing non-surgical care 
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alone with BKP for the treatment of patients with acute 
painful vertebral fractures. The study included patients 
from 21 sites in eight countries (Austria, Belgium, 
France, Germany, Italy, Sweden, the Netherlands and 
the United Kingdom) and was conducted from February 
2003 through December 2005. 

All participants had at least one acute thoracic or 
lumbar (T5-L5) vertebral fracture with bone marrow 
signal changes on magnetic resonance imaging (MRI), 
and vertebral height reduction (> 15% of predicted ver- 
tebral height) compared with the adjacent vertebrae. 
Painful vertebral fractures were diagnosed by the local 
investigator; up to three fractures could be treated if 
they also had signal changes, rapidly progressive height 
loss or pseudoarthrosis. 

Participants had self-assessed back pain of at least 4 
on a scale from 0 (no pain) to 10 (worst pain imagin- 
able) that started within the past 3 months and was not 
attributable to other causes. Vertebral fractures were 
included irrespective of aetiology; however, fractures due 
to primary bone tumours, osteoblastic metastases or 
high-energy trauma were excluded. Participants gave 
written informed consent before enrolment, and the 
protocol and consent forms were approved by local 
ethics committees. The trial was conducted in accor- 
dance with the Declaration of Helsinki, and is registered 
with ClinicalTrials.gov (number NCT00211211). 

Patients were randomly assigned in a 1:1 ratio to 
receive BKP or non-surgical care using a computer-gen- 
erated schedule. Study randomisation was stratified for 
gender, fracture aetiology, use of bisphosphonates at the 
time of enrolment and use of systemic steroids during 
the last 12 months before enrolment, but not for num- 
ber of prevalent fractures per participant. A permuted 
block randomisation (stratified as indicated) was gener- 
ated using PROC PLAN prior to the start of the study. 

Percutaneous BKP was performed with introducer 
tools, inflatable bone tamps, and polymethylmethacrylate 
bone cement and delivery devices (Medtronic Spine 
LLC, Sunnyvale, CA, USA) using a bilateral, transpedi- 
cular or extrapedicular approach. Patients received 
analgesics, bed rest, bracing, physiotherapy, rehabilita- 
tion programmes and walking aids according to the 
standard practices of participating physicians and hospi- 
tals. All patients were referred for treatment with cal- 
cium and vitamin D supplements, and antiresorptive or 
anabolic agents. 

The primary endpoint was the change from baseline 
to 1 month in quality of life (QoL) assessed using the 
Short-form 36 (SF-36) physical component summary 
(PCS) scale. Secondary endpoints included: EuroQol 5- 
Dimension Questionnaire (EQ-5D); SF-36 subscale 
scores; function measured using the Roland-Morris Dis- 
ability (RMD) score; back pain assessed with a visual- 



analogue scale (VAS; scale 0-10); limited days of activity 
and bed rest because of back pain during the previous 2 
weeks; and patient satisfaction assessed on a 20-point 
Likert scale (extremely dissatisfied to extremely satis- 
fied). Outcomes were assessed at baseline/screening and 
at 1, 3, 6, 12 and 24 months; back pain was also 
assessed at 7 days. 

Statistical methods 

Six outcome measures - SF-36 PCS scale, EQ-5D, RMD 
score, back pain, number of days with restricted activity 
in last 2 weeks and number of days in bed in last 2 
weeks - were analysed using four methods for dealing 
with missing data: CC analysis, simple imputation with 
LOCF analysis, MM analysis on all available data and 
MI analysis. 

The CC analysis included only patients with both 
baseline and all follow-up values for respective out- 
comes. For the LOCF analysis, only patients with avail- 
able baseline values were included; missing follow-up 
values were replaced by the patient's last observed value, 
based on the assumption that this represented the treat- 
ment effect. In contrast with the LOCF method, MI is a 
stochastic imputation method based on the assumption 
that missing values can be replaced with values gener- 
ated by a model incorporating random variation. The 
generation of such values is performed repeatedly pro- 
viding a series of complete datasets. These datasets are 
then analysed using standard methods for complete 
data, and the results are combined to provide a set of 
parameter estimates and their standard errors, from 
which confidence intervals and p-values can be derived. 
The MI model can be different from the model used for 
the final data analysis. In this study we imputed data 
using as-treated models, and analysed them according to 
ITT [2]. 

For the CC, LOCF and MI methods, the analysis was 
performed using a conventional repeated-measures 
ANOVA design. The model included treatment group 
and visit as fixed factors, as well as their interaction, 
together with covariates representing the randomisation 
stratification factors (gender, fracture aetiology, use of 
bisphosphonates at the time of enrolment and use of 
systemic steroids during the last 12 months before 
enrolment) and baseline values. 

In the MM analysis, all patients with at least one base- 
line or follow-up value were included. An MM analysis 
includes both fixed and random factors: in the current 
analysis, treatment group and visit were included as 
fixed factors, and patient was included as a random fac- 
tor. The model included interactions between treatment 
and visit. Randomisation stratification factors (gender, 
fracture aetiology, use of bisphosphonates at the time of 
enrolment and use of systemic steroids during the last 
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12 months before the time of enrolment) and baseline 
value were included as covariates. Compound symmetry 
structure for covariance between measurements was 
assumed. Maximum restricted likelihood procedure was 
used to fit the model and denominator degrees of free- 
dom were estimated using Satterthwaite's approxima- 
tion. This mixed model analysis is, with balanced data, 
equivalent to the conventional repeated measures 
ANOVA with sphericity assumption [8]. 
Assumptions 

The CC approach assumes that data are missing com- 
pletely at random (MCAR) - i.e. that missing data are a 
random subsample of all data. The LOCF approach 
assumes that baseline values are MCAR, and that there 
is no change in treatment effect during follow-up. It is 
also assumed that there is no variation in the measure- 
ment of treatment effect itself. In both the MM analysis 
and the MI technique, it is assumed that data are miss- 
ing at random (MAR), meaning that the probability that 
an observation is missing may depend on observed data 
but not on missing data. Thus, the likelihood function 
for the complete dataset, with respect to inference on 
the unknown parameters that characterise the complete 
data distribution, is the same as the likelihood function 
for the observed data. In the MM approach, the MAR 
assumption needs to be fulfilled in the analysis model, 
whereas in the MI approach only the imputation model 
needs to satisfy the MAR assumption. 
Imputation model 

In this study, the imputation model included all six out- 
comes at baseline and follow-up visits, all randomisation 
stratification factors, age, treatment centre and number 
of fractures at baseline (> 1), in addition to treatment 
'as received'. Treatment centres with less than 7% of 
patients overall were combined. Treatment 'as received', 
rather than treatment 'as randomised', was used in the 
imputation model to recover the 'true' outcome values 
for patients. This was done because the observed out- 
come values for the patient are influenced by the treat- 
ment received not the treatment assigned. 

Multiple imputation using fully conditional specifica- 
tion (or 'chained equations') [9] was used to create 30 
imputed datasets, primarily because the distribution of 
number of days with reduced activity and number of 
days in bed were substantially non-normal. Therefore, 
the variables were treated as ordinal in the imputation 
model. Other outcome variables were treated as contin- 
uous. PCS followed a normal distribution and was not 
transformed. Roland-Morris disability score, VAS and 
EQ-5D were transformed to normality using Box-Cox 
transformation and imputed with predictive mean 
matching method [10] to ensure that the imputed values 
did not exceed the natural range of values for those 
variables. After imputation the values were transformed 



back to the original scale. Other covariates were 
included as dummy (0-1) variables. To create the impu- 
tation in STATA the imputation by chained equations 
(ICE) implementation was used [11]. 

The MI approach provides the means to estimate the 
fraction of missing information relative to the complete 
information for each parameter of interest.([6], section 
3.3) If the missing data do not provide any additional 
information about the parameter of interest to that 
which can be observed from the available data then the 
fraction will be equal to zero. If the missing data contain 
a high proportion of information, and this is not also 
contained within the observed data, then the fraction of 
missing information will be high. 

Results 

Missing response 

There were no missing data in baseline covariates 
values, and only a few missing baseline values in out- 
come variables. The missing-response level increased 
during follow-up from an average of 14.5% at 1 month 
to 28% at 24 months; corresponding to a mean of 19% 
missing data during the entire follow-up period. Of the 
six outcome variables, back pain had the highest avail- 
able response level (83.4%) and RMD score had the low- 
est (77.2%; Table 1). Overall patterns of missing 
response across time were similar in both treatment 
groups, with consistently higher available response in 
the BKP group than in the control group (mean 9% 
higher for all outcome variables, with the exception of 
the RMD score where the difference between treatments 
was approximately 3%; Figure 1). 

Estimated fraction of missing information 

The fraction of missing information can be used instead 
of the traditional percentage of missing values to assess 
the level of missing response in surveys [12]. It does not 
need to be equal to the nominal amount of missing 
values, but can be smaller if there is information about 
missing data in other correlated and observed variables, 
or larger if patients with missing values have more influ- 
ence on regression coefficients than patients with 
observed values. The fraction of missing information 
and nominal percentage of missing values for each of 
the six outcomes are presented in Table 2. With the 
exception of 'days in bed', the fraction of missing infor- 
mation estimated using the MI method was lower than 
the nominal percentage of missing data per visit. 

Comparison of analysis methods 

Results for the analysis of variance with repeated mea- 
sures on the CC dataset, the dataset imputed with 
LOCF method, and datasets resulting from the MI pro- 
cedure and MM analysis on all available data are shown 
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Table 1 Percentage missing data in the FREE trial overall 

Follow-up (time after randomisation) 



Measure 


Baseline 


1 month 


3 months 


6 months 


12 months 


24 months 


Mean 


SF-36 PCS 


2.7 


13.0 


19.7 


21.0 


25.0 


27.3 


18.1 


EQ-5D 


2.3 


13.0 


19.3 


20.7 


24.7 


25.3 


17.6 


RMD score 


1.7 


15.7 


25.0 


26.7 


32.0 


36.0 


22.8 


Back pain 


0.7 


12.0 


18.0 


19.7 


24.7 


24.3 


16.6 


Restricted activity 


4.0 


18.0 


22.3 


22.0 


26.0 


28.3 


20.1 


Days in bed 


3.3 


15.3 


20.7 


20.7 


24.7 


27.0 


18.6 



Values shown are percentages 

EQ-5D, EuroQol 5-dimension; RMD, Roland-Morris Disability; SF-36 PCS, short form-36 physical component summary 



in Table 3. Complete-case analyses were performed on 
62%, 64%, 49%, 62%, 54% and 66% of patients for SF-36 
PCS, EQ-5D, RMD score, number of days in bed, num- 
ber of days with restricted activity, and back pain, 
respectively. Almost half of all randomised patients were 
not available for a CC analysis. A maximum of 4% of 
patients were not included in the LOCF analysis because 
of missing baseline values. All randomised patients were 
included in the MM and MI analyses. 

The consequences for these approaches are, inter alia, 
differences in size of standard errors of the estimates. 
The standard errors (SE) were compared with those 
from CC, which was used as the reference method. The 
mean squared errors (MSE) from CC, LOCF, MM and 
MI were calculated with means of the estimates from all 
four methods as reference values. The percentage reduc- 
tions in size of SE and MSE in estimates of treatment 
effect at five follow-up periods are presented in Figure 2 
and 3. 

The highest percentage reductions in standard errors 
at all follow-up months were observed with the LOCF 
analysis, about 20% for all outcomes apart from days in 
bed; MM showed higher (3% on average) percentage 
reductions in standard errors than MI at each follow-up 
month. However, it should be noted that LOCF has the 
potential of underestimating the true standard error, as 
it assumes no variation in the observations carried for- 
ward, so it may not be as reliable as the other two 
imputation methods. 

Percentage reductions in size of standard errors 
compared with CC for overall treatment effect are pre- 
sented in Figure 4. The reductions achieved when 
using the imputation methods (LOCF, MM, and MI) 
allow more precise estimates of the treatment effect to 
be generated for each outcome. For five of the out- 
comes, all three methods provided percentage reduc- 
tions in standard error compared with CC for overall 
treatment effect (Figure 4). However, for the outcome 
'days in bed' only the MM method provided a reduc- 
tion in size of standard errors, between 1% and 6% 
(Figures 2 and 4). A reduction was observed at most 



timepoints (months 1-12) with the MM method, 
whereas increases in standard error were observed 
across all follow-up timepoints (months 1-24) for 
LOCF and MI methods (Figure 2). 

These findings are supported by the comparisons of 
MSEs (Figure 3), which also show that the MM and MI 
approaches tend to yield more precise estimates, 
whereas the CC analysis tend to result in lower preci- 
sion, at least for other outcomes than "days in bed". 

Discussion 

The different analysis methods (CC, LOCF, MM and 
MI) applied to the FREE trial data produced similar 
results, with only minor differences in standard error 
sizes observed between the groups. This suggests that 
the FREE trial data are not dependent on the ITT analy- 
sis method used. Indeed, the information contained 
within the missing data would seem to be of a similar 
nature to the actual documented information. This 
implies that the conclusions made about the efficacy of 
BKP treatment in the FREE trial are robust. 

The literature shows that use of the ITT principle in 
orthopaedic randomized clinical trials (RCT) is limited; 
in a survey of eight leading orthopaedic journals only 
35% of RCT used the ITT method [5]. Thus, lack of 
ITT use could be a potential source of bias for a com- 
paratively large proportion of orthopaedic RCT. Similar 
investigations have previously been performed for the 
general medical literature: an analysis of four leading 
medical journals demonstrated that 119 out of 249 
(48%) RCT published in 1997 used the ITT principle 
[13], while a later analysis of ten medical journals 
showed that 249 out of 403 (62%) RCT published in 
2002 used the ITT method [14]. Therefore, while the 
ITT principle appears to be more commonly used in 
general medical RCT compared with orthopaedic RCT, 
there is still scope for improvement. 

In the FREE study, the missing response level was 
19% for the entire 24-month follow-up period. This 
proportion was similar to that observed for other long- 
term studies; in a survey of RCT published in eight 
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Figure 1 Missing data by treatment group in the Fracture Reduction Evaluation (FREE) trial. EQ-5D, EuroQol 5-dimension; RMD, Roland- 
Morris Disability; SF-36 PCS, short form-36 physical component summary. 



orthopaedic journals the mean rate of patients lost to 
follow up was 17% for thirty studies with a follow-up 
period longer than 1 year [5]. Indeed, increased missing 
response levels in RCT may be expected over longer 



time periods; in the same survey of orthopaedic RCT a 
significant increase in the proportion of patients with 
missing data was observed over longer follow-up peri- 
ods [5]. 
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Table 2 Fraction of missing information compared with 



the nominal amount of missing information 



Follow-up period 




1 month 


3 

months 


6 

months 


12 
months 


24 
months 


Outcome 


FMI 


MD 


FMI 


MD 


FMI 


MD 


FMI 


MD 


FMI MD 


SF-36 PCS 


11.9 


15.7 


15.8 


22.3 


17.2 


23.7 


23.9 


27.7 


30.1 30.0 


EQ-5D 


14.5 


15.3 


13.1 


21.7 


18.0 


23.0 


18.4 


27.0 


20.9 27.7 


RMD 


9.9 


17.3 


19.8 


26.7 


18.2 


28.3 


21.0 


33.7 


19.9 37.7 


Back pain 


12.3 


12.7 


15.7 


18.7 


19.0 


20.3 


17.3 


25.3 


29.6 25.0 


Restricted 
activity 


9.6 


22.0 


14.0 


26.3 


19.6 


26.0 


19.3 


30.0 


20.0 32.3 


Days in bed 


21.2 


18.7 


24.6 


24.0 


27.5 


24.0 


35.4 


28.0 


34.3 30.3 



Values shown are percentages 

EQ-5D, EuroQol 5-dimension; FMI, fraction of missing information; RMD, 
Roland-Morris Disability; SF-36 PCS, short form-36 physical component 
summary; MD, nominal amount of missing data in baseline and follow-up 
data 



While each imputation method provided similar 
results, there are certain characteristics related to each 
analysis method that need to be acknowledged. For 
example, the CC analysis (which is at variance with the 
ITT principle) is generally considered insufficient for 
evaluating data from clinical trials. Data can be missing 
in a sporadic manner across several different covariates 
and this can lead to the omission of a high proportion 
of patients for a CC analysis. Indeed, in the FREE trial 
almost half of all randomised patients were not available 
for the CC analysis. It is possible that only including 
patients with complete information may not provide a 
representative randomly selected subset of the total ran- 
domised population. The higher MSE for the CC analy- 
sis can also be interpreted as a sign of bias in the 
treatment effect estimate. However, in the FREE study 
the results obtained for the CC analysis were reasonably 
consistent with the MM, not being an imputation 
method, and MI suggesting those patients with complete 
information were representative of the randomised 
population. 

The Cochrane Musculoskeletal Group recommends 
that imputations based on methods such as LOCF are 
acceptable in both 'platinum' and 'gold' level publica- 
tions included in Cochrane systematic reviews [15]. 
Implicit in the LOCF method is the assumption that the 
outcome remains constant from the last observed value 
after drop out and that no measurement errors exist, 
otherwise there is a risk for bias particularly over longer 
follow-up periods. In the FREE trial treatment effects 
were actually observed to change during the follow-up 
period [7]. At 1 month follow-up, patients treated with 
BKP showed significantly (p < 0.0001) greater improve- 
ments in SF-36 PCS score compared with those who 
received non-surgical treatment. However, at 12-months 



follow-up the difference between the two groups had 
diminished likely due to fracture healing in patients who 
had received non-surgical treatment. Therefore, use of 
LOCF to analyze the FREE trial data could be a poten- 
tial source of bias. However, the present analysis showed 
that the LOCF method produced similar results to the 
MM and MI methods suggesting that any potential 
effects on bias would likely be minimal. 

The MI method does not have the disadvantages asso- 
ciated with single imputation methods: variance is not 
underestimated and treatment effect changes are pre- 
served over time. The MI method does rely to a great 
extent on the imputation model being correct, but pro- 
viding the data are normally distributed or can be trans- 
formed to normality there is a sound theoretical 
background to how the imputations are generated [16]. 
However, if some of the imputed variables cannot be 
treated as normally distributed then other MI methods, 
such as chained equations, need to be used. The chained 
equations method has produced accurate imputations in 
various settings [9], but the overall theory that proves 
its correctness is currently being developed. The inclu- 
sion of more variables in the imputation model than in 
the analysis model increases the probability that MAR 
assumptions hold, although this may lead to an increase 
in the standard errors of the final estimates [16]. In case 
the imputation model and the analysis models differ it is 
crucial to ensure that the models are congenial, i.e. that 
all variables used in the analysis model are included in 
the imputation model [17]. 

When surgical intervention in one group is compared 
with a non-surgical treatment in another, the propor- 
tions of patients lost to follow up could be expected to 
differ between the groups. The phenomenon has been 
described earlier [5], and it occurs also in this trial. The 
higher rate of missing data in the control group may 
undermine a MAR assumption, which would affect both 
the MM and CC analysis. However, if the MAR assump- 
tion holds for the analysis model used then MI should 
converge to MM as the number of imputations goes to 
infinity [18]. For the FREE data there were no substan- 
tial differences observed between the MM and MI ana- 
lyses despite the fact that the imputation model 
included more variables than the analysis model. This 
indicates that the MAR assumption holds for the analy- 
sis model and that the MM analysis method appears to 
be optimal. 

In the present analysis of the FREE data, the MM 
method showed the highest statistical precision without 
the unsubstantiated assumptions required for LOCF. 
This was particularly evident when the three methods 
(LOCF, MM and MI) were used to examine the percen- 
tage reduction in standard error compared with CC for 
overall treatment effect. Only MM provided a reduction 
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scale and restricted activity. CC, complete case; LOCF, last observation carried forward; Ml, multiple imputation; MM, mixed-effects model; SE, 
standard error. 
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in variance across all six outcomes measured (including 
'days in bed'). The MI method also provided low var- 
iance but was not as precise as the MM method across 
all outcomes considered. This suggests that application 
of the MM method is probably the most accurate 
approach for analysing these data. This suggestion is 
also supported by a simulation study [18]. 

Conclusions 

The FREE trial results are robust as the four alternative 
methods used for substituting missing data produced 
similar results. The small differences observed for the 
MM and MI models provide strong support for their 
use. By comparison the CC model included only 
approximately half of the ITT population and LOCF 



appeared to overestimate precision, which could poten- 
tially produce biased results. 
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